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Abstract 

In this article we discuss six degrees of separation, which has been suggested by Milgram's 
famous experiment [I], [2], from a theoretical point of view again. Though Milgram's experi- 
ment was partly inspired to Pool and Kochen's study [4] that was made from a theoretical 
point of view. At the time numerically detailed study could not be made because comput- 
ers and important concepts, such as the clustering coefficient, needed for a network analysis 
' nowadays, have not yet developed. In this article we devote deep study to the six degrees of 

separation based on some models proposed by Pool and Kochen by using a computer, numer- 
q ■ ically. Moreover we estimate the clustering coefficient along the method developed by us [7J 

and extend our analysis of the subject through marrying Pool and Kochen's models to our 
CZ2 , method. 
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In 1967, Milgram made a great impact on the world by advocating the concept "six degrees 
of separation" by a celebrated paper pQ written based on an social experiment. "Six degrees of 
separation" shows that people have a narrow circle of acquaintances. A series of social experiments 
made by him and his joint researcher[2] suggest that all people in USA are connected through about 
6 intermediate acquaintances. Their studies were strongly inspired by Pool and Kochen's study 
[4]. At the time, however, numerically detailed study [4] could not be made because computers 
and important concepts, such as the clustering coefficient, needed for a network analysis nowadays, 
' have not yet developed sufficiently. 

One of the most refined models of six degrees of separation was formulated in work of Watts and 
Strogatz 5 ,[6J. Their framework provided compelling evidence that the small- world phenomenon 
is pervasive in a range of networks arising in nature and technology, and a fundamental ingredient 
in the evolution of the World Wide Web. But they do not examine closely Milgram's original 
findings by their model:, especially how influence can the clustering coefficient proposed in their 
paper [5] have. We have made a study of them in our previous paper [7J based on a homogeneous 
hypothesis on networks. As a result, we found that the clustering coefficient has not any decisive 
effect on the propagation of information on a network and then information easily spread to a lot 
of people even in the cases with a relatively large clustering coefficient; a person only needs dozens 
of friends. 

In this article we devote deep study to the six degrees of separation based on some models 
proposed by Pool and Kochen [4] by using a computer, numerically. Moreover we estimate the 
clustering coefficient along the method developed by us [7J and extend our analysis of the subject 
through marrying Pool and Kochen's models to our method. As a result, it seems to be difficult 
that six degrees of separation is realized in the models proposed by Pool and Kochen[4] on the 
whole. 

The plan of this article is as follows. In the next section we argue on the first idea proposed by 
Pool and Kochen[3], where they impose a hypothesis on the average number rrij of acquaintances 
common to j individuals. In the section we give the condition that information spreads to about 
10 9 people within the parameters introduced in the model and evaluate how many people can 
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receive information released from one person after 6 ~ 10 steps of transmission of information. 
We also argue that the results given by Pool and Kochen are unstable. In section 3, we study 
their improved model. We there investigate the almost same subjects as ones in the section 2. In 
section 4, we apply the propagation coefficient model proposed by us [7] to Pool and Kochen[3] to 
evaluate the clustering coefficient. Then we discuss whether six degrees of separation is feasible in 
Pool and Kochen's models[4]. The section 5 is devoted to summary and consideration. 



2 Pool and Kochen Model 



Some models for patterns of social contacts were described in Pool and Kochen's paper [4]. 
They can be broadly classified into two groups: the model with social strata and the ones without 
social strata in the population considered. We concentrate our discussion on the former cases, 
which are described in the section "The number of common acquaintances" of their paper. There 
mainly two considerations except for trivial notions are, mainly, described. 

N is the total population in the region considered. A unique characteristics in their model is 
to introduce the average number m,j of acquaintances common to j individuals such as illustrated 
in Fig.l. They assumed the following relation between m J+1 and my, 



for j = 1, 2, 3, 4, • • ■ with < a < 1. 



(1) 



This means that for example the average number of acquaintances common to five people is smaller 
than the average number common to four by a factor a, which is the same proportion as the number 
of friends shared by four is to the number shared by three. This a is between and 1 and should 
be statistically estimated. It is also assumed that a is independent of j 
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FigDl. A schematic illustration of mj.@@ 

@ 

When rii is the number of independent acquaintances standing at intervals of i link length 
away from a person A (i-th generation from A), we can express Pi, which is the probability that 
acquaintances standing at i-th generation from the person A are just an acquaintance of another 
person B chosen randomly from people except for the acquaintance tree starting from A , as 



3=0 



(2) 



The Fig. 2 shows the situation of Pool and Kochen Model. The following recursion relation holds 



for m, 



TH+i = -{!-(! -aD, 



(3) 



where n is the average number of acquaintances that any person knows (referred as the propagation 
coefficient in our paper T]) and satisfies m± = n. By solving this recursion relation, we can find 
the total population M(d) that receive information propagated from A during d generations; 



M(d) = Y J n k- 
fc=i 



Paying attention to m\ = n\ = n, we obtain 



Na 2 



1 - (1 - a)™' 

which leads to a useful relation a and the propagation coefficient n. 



(4) 
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FigD2. A schematic illustration of Pool and Kochen Model. @@ 
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Fig. 3 shows the numerical relation of Eq. (4) for N — 10 9 . The value of N comes from 
a slightly large population of the USA. From this, in order to obtain a realistic value of n for 
N = 10 9 , extremely small values of a are needed such as a ~ O(10~ 6 ) when n ~ O(10), and 
a ~ O(10" 4 ) when n ~ O(10 2 ~ 3 ). We refer to Bernard et al. [3[9l[T0], where they estimate that 
the average person has a social circle of about 290 people from empirical studies. We infer a ~ (a 
few) x 10 -4 from that value. 

From these results, evaluating the total propagation population M under appropriate values of 
a and n, we obtain the following results; M = 4 x 10 5 for a ~ 10~ 7 and n = 10, and M = 6 x 10 6 
for a = 10 -3 and n = 1346. This means that M evaluated is insufficient for information even to 
spread to only one percent of the total population even in the case with M ~ 6 x 10 6 . Since n, 
rapidly converges to a constant when generation i grows into about 3 in the case of n — 1349, such 
as shown in Fig. 4, M grows larger in proportion to generation number d for d > 3. This is a 
reason that information can not spread over a large population. 
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@ Poor and Kochen mainly gave analyses of the relation between Pj and m- Analyzing numerically 
it in some detail, we find that the relation so unstable that it is difficult to get reliable claims. As 
shown in Fig. 5, m rapidly grows around P2 — 1/3 and that is to say, P2 rapidly grows for small 
changing of n\. Though the relation between Pj and n, is unreliable around there, the relation 
between a and n is stable as shown in Fig. 3. Thus the estimation of M calculated above is also 
reliable. 



3 Version up Model 

As the second step, Poor and Kochen developed their model more fully. There they introduce 
a set K A of A's circle of acquaintances and its complements^ . Ai denote the individuals in the 
set K A . The following assumption are made on the conditional probability Prob(B e K Ak \B S 



Prob{B G K Ak \B e K Ah _ x ,B& K Ak 



B eK Al ) = Prob{K Ak \K Ak _ x ) = b = const. (5) 



where B is a person randomly chosen and the constant b should be statistically estimated. Thus 
we get [4] 

Prob(K Ak ,K Ak _ x ,--- ,K Al )=Prob(K Al )b k - 1 = (1 



N 



)b k 



fc-i 



Since for k = 2 



so we have 
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Prob{K A2 ,K Al ) = (l--)b = l- — 
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From these equations, we get [4] 

n fc+1 = ^{l-(l-^r4, 

TO2 ^ 77 J 

fe-1 

p k = i[{i-Pi}Pi, 



i=0 
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(6) 
(7) 
(8) 

(9) 
(10) 
(11) 



By using these relations, Poor and Kochen mainly studied about P/j, but did not give no 
consideration to M. In order to do it we only need to solve the recursion relation (9). In this 
article we numerically estimate M for N = 10 9 , changing values of n and 7772,. The results are 
partly given by Fig. 6. As expected, M increases as 77 becomes larger. We also find a natural result 
that the more larger 7772 is, the smaller M is. It is, however, impossible that information does 
spread to most of the total population even after 10-th generation in the both cases of Fig. 6. 
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Fig. 6. Total population M for n = 200 (left) and 77 = 1000 generation (right) in 10-th. 
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4 Poor-Kochen Model and Clustering Coefficient 



In this section we calculate the clustering coefficients in Poor-Kochen Model according to the 
general method developed in the propagation model [TJ. First of all, we tidy the notations used in 
the model. 

m is the number of nodes in i-th generation G{. 

N is the number of total nodes or the size of a network. 

C is the clustering coefficient of a network. 

Ci is the contribution to the clustering coefficient produced in G;. 
ki t i is the number of edges connected between the same generation i. 
fcii+i the number of edges from a node j in Gi to nodes of Gj+i- 
fcj^+i is the average of fej^+i over au n °des relevant to the generation; 
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Using these quantities, we can express the average degree K of a network. 

Ikj 



K = 1 



k 



(12) 



(13) 



In the propagation model we assumed that C, was constant, but we now should assume that the 
population m in each generation is constant in Pool-Kochen model. So assuming that the recursion 
relation on n, satisfies 



we have 



ki,i+irii(l - q) = const. 
1 



H,i+1 



(14) 



(15) 



where q denotes the probability that a node has two acquaintances in the generation earlier than 
the node. A schematic diagram of the propagation model including the parameter q is given by 
Fig. 7. Substituting these expressions into the following equation (16) given by us[7j 



Ci = 



ki-i,irii-i { (ki-i,i - 1)(K - 1 - 2q(K - 1 - 



K(K - 1) 



ki—\,%yii—\ 1 



(16) 



We obtain 



CM) 



nq 



(1 - q) 2 K(K - l)(n - 1 - q)(n - 1) 



(n - 1 ) (K - qK - 1 ) + 2{K - qK + q - 2) (n - 1 + q) 



(17) 

When varying q, the behavior of Ci in the propagation model is shown in Fig. 8 where K — 200 
and h = 1000 are taken. This value will be proper for inhomogeneous networks with respect to 
degree distribution , which is actually assumed in this section. For larger values of K and n such as 
K = 1000 and n = 150000, we have checked that Ci increases more rapidly as q grows larger. Thus 
the clustering coefficient does not grow large unless q considerably becomes large. It is thought 
to be difficult that Pool-Kochen model can realize any small worlds from the perspective of this 
analysis, too. So far networks based on Pool-Kochen models do display a large world property and 
small clustering coefficient is preferable. 
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Fig 7. A schematic diagram of the propagation model. 
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Fig 8. q-Ci plot for K = 200 and n = 1000. 



Pool and Kochen have made some discussions on models with social strata. Since it is, however, 
though that the models do not bring any correct results, we abandon the pursuit of the models. 



5 Summary and Consideration 

In this article we numerically analyzed how six degrees of separation can be realized in human 
networks based on a series of Pool-Kochen models. Moreover we estimate the clustering coefficient 
of Pool-Kochen models according to the propagation model and explored the possibility of small- 
worldness. 

In result, we found that it is difficult that Pool-Kochen models realize six degrees of separation 
and also achieve a large clustering coefficient. Recently Kleinfield has fanned some critical discus- 
sions to Milgram's empirical evidence for six degrees of separation [11] . Later Watts ea al. have 
conducted by far the largest ever small-world experiment by using E-mail, involving 60 thousand 
E-mail users with targets over 13 countries [THQ2]. As they recognize, their experiment has a pos- 
itive bias in the choice of E-mail users. The small world problem, however, remains as fascinating 
psychological mysteries. In my opinion, the meaning of Pool-Kochen's study is that the models 
will not do much a better understanding of six degrees of separation but inspired Milgram and so 
on to study interesting subjects such as six degrees of separation. 

Once we discussed it based on homogeneous hypothesis in the propagation model [7] and find 
that six degrees of separation likely to materialize somewhat. The hypothesis is, however, no 
correct. There is considerably deviation in the degree distribution, the clustering coefficient and so 
on in real networks. To understand six degrees of separation more really, we should introduce some 
correct distributions into the degree distribution, the clustering coefficient and so on. Newman 
[14j has made discussions on it by considering some distribution in the degree distribution and 
furthermore "mutuality" which is a quantity that reflects the density of squares in human relations. 
This direction of study seems to play an important role in the studies of six degrees of separation. 
The detail research toward the line of this should be made more properly. 
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